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04 . Abstract. We show that every sufBciently large oriented graph G with 5^{G),5 (G) > 

}_( ' ^""'^ contains a Hamilton cycle. This is best possible and solves a problem of Thomassen 

Ph! from 1979. 

<: 
o : 

^ ' 1. Introduction 

i A central topic in graph theory is that of giving conditions under which a graph is Hamil- 

' tonian. One such result is the classical theorem of Dirac [7j, which states that any graph on 

• . n > 3 vertices with minimum degree at least n/2 contains a Hamilton cycle. For an analogue 

I in directed graphs it is natural to consider the minimum semi-degree 6^{G) of a digraph G, 

2 ' which is the minimum of its minimum outdegree 6+{G) and its minimum indegree 5 (G). 

. The corresponding result is a theorem of Ghouila-Houri [8], which states that any digraph 

I on n vertices with minimum semi-degree at least n/2 contains a Hamilton cycle. (When 

■ referring to paths and cycles in directed graphs we always mean that these are directed, 
^ [ without mentioning this explicitly.) Both of these results are best possible. 

q!^ ' In 1979 Thomassen |18j raised the natural corresponding question of determining the 

ff^ . minimum semi-degree that forces a Hamilton cycle in an oriented graph (i.e. in a directed 

I graph that can be obtained from a (simple) undirected graph by orienting its edges). Over 

7—i • the years since the question was posed, a series of improving bounds were obtained in 

O , [191 ISni El Unj • Recently, Kelly, Kiihn and Osthus [12] were able to obtain an approximate 

I solution. They proved that an oriented graph on n vertices with minimum semi-degree at 

■ least (3/8 + o(l))n has a Hamilton cycle, which asymptotically matches a lower bound given 
I by Haggkvist [9]. In this paper we obtain the following result which exactly matches the 

^ ' lower bound of Haggkvist, thus answering Thomassen's question for large oriented graphs. 

H ; 

5t , Theorem 1. There exists a number uq so that any oriented graph G on n > uq vertices 

with minimum semi-degree 5'^{G) > [ ^"g"^ ] contains a Hamilton cycle. 

Note that Theorem [1] implies that every sufficiently large regular tournament on n vertices 
contains at least n/8 edge-disjoint Hamilton cycles. (To verify this, note that in a regular 
tournament, all in- and outdegrees are equal to (n — l)/2. We can then greedily remove 
Hamilton cycles as long as the degrees satisfy the condition in Theorem [TJ) This is a slight 
improvement on the (1/8 + o(l))n bound obtained in [12j by the same argument. It is 
the best bound so far towards the classical conjecture of Kelly (see e.g. [3]), which states 
that every regular tournament on n vertices can be partitioned into (n — l)/2 edge-disjoint 
Hamilton cycles. 

Haggkvist [9] also made the following conjecture which is closely related to Theorem [TJ 
Given an oriented graph G, let d{G) denote the minimum degree of G (i.e. the minimum 
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number of edges incident to a vertex) and set 6*{G) := 6{G) + S~^{G) + 6~{G). Haggkvist 
conjectured that if 6*{G) > (3n — 3)/2, then G has a Hamilton cycle. (Note that this 
conjecture does not quite imply Theorem [1] as it results in a marginally greater minimum 
semi-degree condition.) In [12j, this conjecture was verified approximately, i.e. if 6*{G) > 
(3/2 + o(l))n, then G has a Hamilton cycle. It seems possible that our approach can be 
extended to obtain an exact solution to this problem, but this would certainly require some 
additional ideas beyond those applied here. 

Our argument can be extended to find a cycle of any length £, with i > (say) 
through any given vertex. We indicate the necessary modifications to the proof in the final 
paragraph, the details can be found in [11]. This result is used in [13] to obtain the following 
pancyclicity result: any sufficiently large oriented graph G with 5^{G) > (3n — 4)/8 contains 
a cycle of length i for all ^ = 3, . . . , n. For ^ = 4, . . . , n, it even contains a cycle of length i 
through a given vertex. 

The rest of this paper is organized as follows. The next section contains some basic 
notation. In Section 3 we describe the extremal example showing that Theorem [1] is best 
possible. We set up our main tools in Section 4, these being a digraph form of the Regularity 
Lemma due to Alon and Shapira [1] and the Blow-up Lemma, both in the original form of 
Komlos, Sarkozy and Szemeredi [15] and in a recent stronger and more technical form due 
to Csaba [5]. Our argument uses the stability method, and so falls naturally into two cases, 
according to whether or not our given oriented graph G is structurally similar to the extremal 
example described in Section 3. In Section 5 we prove a lemma that enables us to find a 
Hamilton cycle when G is not structurally similar to the extremal example. The argument 
in this case is based on that of [12]. Then we prove our main theorem in the final section. 

2. Notation 

Given two vertices x and y of a directed graph G, we write xy for the edge directed 
from X to y. The order \G\ of G is the number of its vertices. We write Nq{x) for the 
outneighbourhood of a vertex x and d'Q{x) := |A''^(x)| for its outdegree. Similarly, we 
write Nq{x) for the inneighbourhood of x and d^ix) := |iV^(x)| for its indegree. We write 
Ng{x) := N^{x) U N^{x) for the neighbourhood of x and dcix) := \Ng{x)\ for its degree. 
We use N^{x) etc. whenever this is unambiguous. We write A(G) for the maximum of 
|A^(2;)| over all vertices x G G. Given a set A of vertices of G, we write Nq{A) for the set of 
all outneighbours of vertices in A. So Nq{A) is the union of N^{a) over all a £ A. Nq{A) 
is defined similarly. The directed subgraph of G induced by A is denoted by G[A] and we 
write E{A) for the set of its edges and put e{A) := \E{A)\. If 5 is a subset of the vertex set 
of G then G — S denotes the digraph obtained from G by deleting S and all edges incident 
to S. 

Recall that when referring to paths and cycles in directed graphs we always mean that 
they are directed without mentioning this explicitly. Given two vertices x and ?/ on a directed 
cycle G, we write xGy for the subpath of G from x to y. Similarly, given two vertices x and y 
on a directed path P such that x precedes y, we write xPy for the subpath of P from x to y. 
A walk in a directed graph G is a sequence of (not necessarily distinct) vertices vi,V2, ■ ■ ■ ,V£ 
where fjUi+i is an edge for all 1 < i < ^. The walk is closed if vi = vi. A 1- factor of G is a 
collection of disjoint cycles which cover all the vertices of G. We define things similarly for 
undirected graphs. 

Given two vertices x, y of G, an x-y path is a directed path which joins x to y. Given two 
disjoint subsets A and B of vertices of G, an A-B edge is an edge ah where a G A and h £ B. 
We write E{A, B) for the set of all these edges and put e{A, B) := \E{A, B)\. We sometimes 
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Figure 1. The oriented graph in the proof of Proposition [2l 



also write edA, B) for e[A^ B). We denote by {A, B)g the oriented bipartite subgraph of G 
whose vertex classes are A and B and whose edge set is E(A, B). 

We call an orientation of a complete graph a tournament and an orientation of a com- 
plete bipartite graph a bipartite tournament. Throughout the paper we omit floors and 
ceilings whenever this does not affect the argument. For a positive integer k we write 
[k] :={!,..., k}. 

3. The extremal example 

In this section, we show that the bound in Theorem [T] is best possible for every n. The 
construction is due to Haggkvist [9]. However, he only gives a description for the case when 
n is of the form 8k + 7. Below, we also add the remaining cases to give the complete picture. 

Proposition 2. For any n > 3 there is an oriented graph G on n vertices with minimum 
semi-degree \{3n — 4)/8] — 1 which does not contain a 1-factor, and so does not contain a 
Hamilton cycle. 

Proof. We construct G as follows. It has n vertices partitioned into 4 parts A, B,C, D, 
with \B\ > \D\. Each of A and G spans a tournament, B and D are joined by a bipartite 
tournament, and we add all edges from A to B, from B to C, from C to -D and from D to 
A (see Figure 1). Since every path which joins two vertices in B has to pass through D, it 
follows that every cycle contains at least as many vertices from D as it contains from B. As 
\B\ > \D\ this means that one cannot cover all the vertices of G by disjoint cycles, i.e. G 
does not contain a 1-factor. 

It remains to show that the sizes oi A, B,G, D and the tournaments can be chosen to give 
minimum semi-degree S^{G) = [(3n — 4)/8] — 1. The following table gives possible values, 
according to the value of n mod 8: 



n 


[(3n-4)/8] - 1 


\A\ 


\B\ 


\G\ 


\D\ 


8k - 1 


3k -1 


2k - 1 


2k + I 


2k - 1 


2k 


8k 


3k -I 


2k 


2k + I 


2k - 1 


2k 


8k + 1 


3k -1 


2k 


2k + 1 


2k 


2k 


8k + 2 


3k 


2k 


2k + 2 


2k - 1 


2k + 1 


8k + 3 


3k 


2k 


2k + 2 


2k 


2k + 1 


8k + 4 


3k 


2k + I 


2k + 2 


2k 


2k + 1 


8k + 5 


3k + I 


2k + 1 


2k + 2 


2k + 1 


2k + 1 


8k + 6 


3k + I 


2k + 2 


2k + 2 


2k + 1 


2k + I 
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We choose the tournaments inside A and C to be as regular as possible (meaning that the 
indegree and outdegree of every vertex differ by at most 1): this can be achieved (e.g.) by 
arranging the vertices evenly around a circle and directing edges so that the larger part of the 
circle lies to the right of each edge, breaking ties arbitrarily. For the bipartite tournament 
between B and D there are two cases. Firstly, when \B\ = 2k + 1 and \D\ = 2k we choose it 
so that \N+{b) nD\ = \N-{b) n D\ = k for every 6 e S and {\N+{d) n B\,\N-{d) n B\} = 
{k, k + 1} (in either order) for every d £ D: this can be achieved by the blow-up of a 4-cycle, 
i.e., write B = Bi U B2, D = Di U D2 with \Bi\ = k + I, \B2\ = \Di\ = |L»2| = k and 
direct edges from Bi to Z?i, from Di to B2, from B2 to D2 and from D2 to Bi. Secondly, 
when \B\ = 2k + 2 and \D\ = 2k + 1 we choose the bipartite tournament between B 
and D so that \N~^{b) f] D\ = k + I and \N^{b) n D\ = k for every b e B and so that 
for every d £ D either \N+{d) D B\ = |iV~(d) D B\ = k + 1 or \N+{d) H B\ = k and 
\N-{d) r\B\ = k + 2. This can be achieved by labelling B as [2k + 2], D as Z/{2k + 1)Z 
and setting N+{b) n D := {{k + 1)6, (/c + 1)6 + 1, . . . , (A: + 1)6 + k} for b £ B. Indeed, to 
check that the vertices in D have the correct indegrees note that N~^{b) R D is a segment of 
consecutive vertices of D for each b € B and the segment A^+(6-|- 1)0 D starts immediately 
after iV+(6) n D and ends with the first vertex of 7V+(6) n D. 

To verify the minimum semi-degree condition we introduce the notation 6g,6g for the 
minimum in- and outdegrees of vertices in 5" G {A,B,C,D}. We can eliminate some of 
the checking by noting that we have always chosen |C| < \A\ and \D\ < \B\, so we have 
inequalities 6^ < 6^^ < 6^ and 5 J < 6^ as well as 6^ ^ ^d- In the cases n = 8k — l,8k,8k + l 
we also have 6^ < 6^ and Sg < S^^, so 

5\G) = mm{6+,6+} = mm{\C\ + [\D\/2\ , [i\C\ - l)/2j + \D\} = 3k - 1. 

In the case n = 8k + 2 we have 6^ = \C\ + {k + 1) = 3k, 6]^ = \A\ + k = 3k, 6q = 
[(|C| - 1)/2J + \D\ = 3k and 6]^ = \ A\ + k = 3k, so 6^{G) = 3k. The constructions for the 
cases n = 8k + 3,8k + 4: are obtained from n = 8k + 2 hy increasing \ A\ and/or \C\ by 1, 
and we still have 5^{G) = 6+ = 3k. When n = 8k + 5 we have (5+ = |C| + {k + l) = 3k + 2, 
6- = \A\+k = 3k + 1, 6^ = [{\C\ - 1)/2J +\D\ = 3k + 1 and = \A\ + k = 3k + 1, 
so 5^{G) = 3k + 1. Finally, the construction for the case n = 8k + 6 is obtained from 
n = 8k + 5 hy increasing \A\ by 1 and we still have 6^{G) = 6q = 3k + 1. In all cases we 
have d^{G) = \{3n - 4)/8] - 1, as required. □ 

Remark. One may add any number of edges that either go from yl to C or lie within D 
without creating a 1-factor (although this does not increase the minimum semi-degree). 

4. The Diregularity Lemma, the Blow-up Lemma and other tools 

In this section we collect all the information we need about the Diregularity Lemma and 
the Blow-up Lemma. See |16j for a survey on the Regularity Lemma and [T3j for a survey on 
the Blow-up Lemma. We start with some more notation. The density of a bipartite graph 
G = {A, B) with vertex classes A and B is defined to be 

eG{A,B) 



dG{A,B) 



\A\\B\ 



We often write d{A, B) if this is unambiguous. Given e > 0, we say that G is e-regular if 
for all subsets X <^ A and Y <^ B with \X\ > e\A\ and \Y\ > e\B\ we have that \d{X,Y) - 
d{A,B)\ < £. Given d € [0,1] we say that G is {£,d)- super-regular if it is e-regular and 
furthermore dcia) > {d - £)\B\ for all a G ^ and ddb) > {d - e)\A\ for all b G B. (This 
is a slight variation of the standard definition of {e, d)-super-regularity where one requires 
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dcio-) > d\B\ and dcib) > d\A\.) Given partitions Vq, Vi, . . . , 14 and Uq, Ui, ■ ■ ■ ,U£ of the 
vertex set of some graph, we say that Vo,Vi, . . . ,Vk refines Uo,Ui, . . . ,Ui if for all Vi with 
i > 1 there is some Uj with j > which contains Vi. Note that Vq need not be contained in 
any Uj, so this is weaker than the usual notion of refinement of partitions. 

The Diregularity Lemma is a version of the Regularity Lemma for digraphs due to Alon 
and Shapira [Ij. Its proof is quite similar to the undirected version. We will use the degree 
form of the Diregularity Lemma which can be easily derived (see e.g. [21] for an explicit 
proof of all but the 'refinement property') from the standard version, in exactly the same 
manner as the undirected degree form. 

Lemma 3 (Degree form of the Diregularity Lemma). For every e S (0,1) and all num- 
bers M' , M" there are numbers M and uq such that if 

• G is a digraph on n > uq vertices, 

• Uq, . . . , Um" is a partition of the vertices of G, 

• d £ [0, 1] is any real number, 

then there is a partition of the vertices of G into Vq, Vi, . . . ,Vk and a spanning subdigraph G' 
of G such that the following holds: 

• M' <k<M, 

• l^ol <en, 

• l^il = • • • = \Vk\ =: m, 

• ^Oi Vi, . . . ,Vk refines the partition Uq, Ui, . . . ,Um" , 

• d'Qi{x) > dQ{x) — {d + e)n for all vertices x £ G, 

• d^,{x) > dQ^x) — {d + e)n for all vertices x £ G, 

• for alii = \,. . . ,k the digraph G'\Vi\ is empty, 

• for all 1 < i, i < k with i ^ j the bipartite graph whose vertex classes are Vi and Vj 
and whose edges are all the Vi-Vj edges in G' is e-regular and has density either or 
density at least d. 

Vi,...,Vk are called clusters, Vq is called the exceptional set and the vertices in Vq are 
called exceptional vertices. The last condition of the lemma says that all pairs of clusters 
are e-regular in both directions (but possibly with different densities). We call the spanning 
digraph G' C G given by the Diregularity Lemma the pure digraph. Given clusters Vi , . . . , 
and a digraph G' , the reduced digraph R' with parameters {£,d) is the digraph whose vertex 
set is [k] and in which ij is an edge if and only if the bipartite graph whose vertex classes 
are Vi and Vj and whose edges are all the Vi-Vj edges in G' is e-regular and has density at 
least d. (So if G' is the pure digraph, then ij is an edge in R' if and only if there is a Vi-Vj 
edge in G' .) 

It is easy to see that the reduced digraph R' obtained from the regularity lemma 'inherits' 
the minimum degree of G, in that 5+{R')/\R'\ > 5+ {G) /\G\ - d - 2e and 5-{R')/\R'\ > 
6~ {G)/\G\ — d — 2£. However, R' is not necessarily oriented even if the original digraph G 
is. The next lemma (which is essentially from |12) ) shows that by discarding edges with 
appropriate probabilities one can go over to a reduced oriented graph R Q R' which still 
inherits the minimum degree and density of G. 

Lemma 4. For every £ £ (0, 1) and there exist numbers M' = M'(e) and uq = no(e) such 
that the following holds. Let d £ [0, 1] with e < d/2, let G be an oriented graph of order 
n > Uq and let R' be the reduced digraph with parameters (e, d) obtained by applying the 
Diregularity Lemma to G with M' as the lower bound on the number of clusters. Then R' 
has a spanning oriented subgraph R such that 

(a) 6+iR)>iS+iG)/\G\-{3e + d))\R\, 
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(b) 5-{R) > i5-{G)/\G\-{3e + d))\R\, 

(c) for all disjoint sets S,T C V{R) with ec{S*,T*) > "idv? we have eji{S,T) > d\R\'^, 
where S* := IJ.es ^* — UeT ^i- 

(d) for every set S C V{R) with ec{S*) > Sdn"^ we have eR{S) > d\R\'^, where S* := 

Proof. Statements (a) and (b) are proven in [12] by considering the following probabilistic 
argument. Let R be the spanning oriented subgraph obtained from R' by deleting edges 
randomly as follows. For every unordered pair Vi,Vj of clusters we delete the edge ij (if it 
lies in R') with probability 

ea{V,,Vi) 

^' eG'{VuVj) + eG>(yj,Viy 

Otherwise we delete ji (if it lies in R'). We interpret ([T]) as if ij, j« ^ E{R'). So if R' 
contains at most one of the edges ij, ji then we do nothing. We do this for all unordered pairs 
of clusters independently. In [12] it was shown that R satisfies (a) and (b) with probability 
at least 3/4. For (c), note that 

^ ea{S',T')-(d + E)n\S\m ^ 3 j 

~ 2 

A Chernoff-type bound (see e.g. [21 Cor. A. 14]) now implies that there exists an absolute 
constant c such that 

FieRiS, T) < d\R\^) < P i\eRiS, T) - £(6^(5, r))| > E(efi(5, r)/3)) 

< g-cE(e_R(S,T)) < g-cd|i?|2^ 

But the number of pairs {S, T) of disjoint subsets of V{R) is at most (21^')^, so the probability 
that R does not satisfy (c) is at most 2^1-^1 e"'^'^'-^' < 1/4. (Here we used that \R\ > M' is 
sufficiently large.) The proof for (d) is similar. Altogether, this implies that there must be 
some outcome which satisfies (a)-(d). □ 

The oriented graph R given by Lemma [4] is called the reduced oriented graph with param- 
eters {e,d). The spanning oriented subgraph G* of the pure digraph G' obtained by deleting 
all the Vi-Vj edges whenever ij € E{R') \ E[R) is called the pure oriented graph. Given an 
oriented subgraph S R, the oriented subgraph of G* corresponding to S is the oriented 
subgraph obtained from G* by deleting all those vertices that lie in clusters not belonging 
to S as well as deleting all the Vi-Vj edges for all pairs Vi, Vj with ij ^ E{S). 

In our proof of Theorem [1] we will also need the Blow-up Lemma of Komlos, Sarkozy and 
Szemeredi |15| . It implies that dense regular pairs behave like complete bipartite graphs 
with respect to containing bounded degree graphs as subgraphs. 

Lemma 5 (Blow-up Lemma, Komlos, Sarkozy and Szemeredi [H]). Given a graph F on 
[k] and positive numbers d, A, there is a positive real r/o = iloid, A, k) such that the following 
holds for all positive numbers £i, . . . ,ik o-nd all < t] < ijq. Let F' be the graph obtained 
from F by replacing each vertex i £ F with a set Vi of ii new vertices and joining all vertices 
in Vi to all vertices in Vj whenever ij is an edge of F. Let G' be a spanning subgraph of F' 
such that for every edge ij G F the graph (V^, V^)^' is {7],d) -super-regular. Then G' contains 
a copy of every subgraph H of F' with A{H) < A. Moreover, this copy of H in G' maps the 
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vertices of H to the same sets Vi as the copy of H in F' , i.e. if h G V{H) is mapped to Vi 
by the copy of H in F' , then it is also mapped to Vi by the copy of H in G' . 

The 'moreover' part of the statement is not part of the usual statement of the Blow-up 
Lemma but is stated explicitly in its proof. 

We will also need to use the following stronger and more technical version due to Csaba [5] . 
(The case when A = 3 of this is implicit in [6].) It removes the dependency of the regularity 
constant rj on the number k of clusters. Also, it does not demand super-regularity. The latter 
is replaced by conditions (C6),(C8) and (C9) below. Moreover, it is explicitly formulated 
to allow for a set Vq of exceptional vertices in the host graph G' which are not part of 
any regular pairs. For this to work, one first has to find a suitable partition LQ,...,Lk 
with |Lo| = |Vo| of the graph H which we are aiming to embed and a suitable bijection 
(j) : Lq ^ Vq. The embedding of H into G' guaranteed by the Blow-up Lemma is then found 
by a randomized algorithm which first embeds each vertex x E Lq to (pix) G Vq and then 
successively embeds the remaining vertices of H. Condition (CI) requires that there are 
not too many exceptional vertices and (C2) ensures that we can embed the vertices in Lq 
without affecting the neighbourhood of other such vertices. Li will be embedded into the 
cluster Vi of G', so we clearly need to assume (C3). Condition (C5) gives us a reasonably 
large set D of 'buffer vertices' which will be embedded last by the randomized algorithm. 
(C7) ensures that the exceptional vertices have large degree in all 'neighbouring clusters'. 
(C8) and (C9) allow us to embed those vertices whose set of candidate images in G' has 
grown very small at some point of the algorithm. 

In the statement of Lemma [6] and later on we write < oi ^ 02 ^ as to mean that we 
can choose the constants 01,02,03 from right to left. More precisely, there are increasing 
functions / and g such that, given 03, whenever we choose some 02 < 7(03) and oi < 5(02), 
all calculations needed in the proof of Lemma [6] are valid. Hierarchies with more constants 
are defined in the obvious way. 

Lemma 6 (Blow-up Lemma, Csaba |5]). For all numbers A, Ki, K2, and every positive 
constant c there exists an number N such that whenever e,e' ,6' ,d are positive constants with 

0<e<^e'<^6'<^d<^ 1/A, 1/Ki,l/K2, 1/Ks,c 

the following holds. Suppose that G' is a graph of order n > N and Vq, . . . ,Vk is a partition 
ofV{G') such that the bipartite graph {Vi,Vj)G' is e-regular with density either or d for 
all 1 < i < j < k. Let H be a graph on n vertices with A{H) < A and let Lq, Li, . . . , be 
a partition ofV{H) with |Lj| = \ Vi\ =: m for every i = 1, . . . ,k. Furthermore, suppose that 
there exists a bijection </> : Lq — > Vq and a set I C V{H) of vertices at distance at least 4 
from each other such that the following conditions hold: 

(CI) \Lo\ = \Vo\ < Kidn. 
(C2) Lo C /. 

(C3) Li is independent for every i = 1, . . . ,k. 
(C4) \Nh{Lo) n Li\ < K2dm for every i = 1,. . . ,k. 

(C5) For each i = 1,. . . ,k there exists Di C I n Li with \Di\ = 5'm and such that for 
D := \Ji^i Di and all 1 < i < j < k 

WNniD) n L,| - iNniD) n Lj\\ < em. 

(C6) If xy £ E{H) and x £ Li,y E Lj then iVi,Vj)G' is e-regular with density d. 
(C7) Ifxy e E{H) and x £ Lo,y e Lj then \NG'i(l)ix)) n > cm. 
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(C8) For each i = 1, . . . , A;, given any C. Vi with \Ei\ < e'm there exists a set Fi C 
{Li n (I \ D)) and a bijection (pi : Ei ^ Fi such that \Ngi{v) H Vj| > (d — e)m 
whenever NH{4>i{v)) H Lj ^ ^ (for all v £ Ei and all j = 1, . . . , k). 
(C9) Writing F := U(Li we have that \Nh{F) n Li\ < K^e'm. 
Then G' contains a copy of H such that the image of Li is Vi for all i = 1, . . . ,k and the 
image of each x £ Lq is 4>{x) G Vq. 

The additional properties of the copy of H in G' are not included in the statement of the 
lemma in but are stated explicitly in the proof. 

We aim to apply the Blow-up Lemma with G' being obtained from the underlying graph of 
the pure oriented graph. In order to satisfy (C8), it will suffice to ensure that all the edges 
of a suitable 1-factor in the reduced oriented graph R correspond to (e, d) -super-regular 
pairs of clusters. A simple calculation implies that this can be ensured by removing a small 
proportion of vertices from each cluster Vi, and so (C8) will be satisfied. However, (C6) 
requires all the edges of R to correspond to e-regular pairs of density precisely d and not 
just at least d. (Although, as remarked by Csaba [5], it actually suffices that the densities are 
close to d in terms of e.) This is guaranteed by the second part of the following proposition 
from [E]. 

Proposition 7. Let M',no,D be positive numbers and let £,d be positive reals such that 
I/uq <^ 1/M' <C e <C d <C 1/D. Let G be an oriented graph of order at least uq. Let R 
be the reduced oriented graph with parameters (e, d) and let G* be the pure oriented graph 
obtained by successively applying first the Diregularity Lemma with e, d and M' to G and 
then Lemma\^ Let S be an oriented subgraph of R with A (5) < D. Let G' be the underlying 
graph of G* . Then one can delete 2De\Vi\ vertices from each cluster Vi to obtain subclusters 
Vi C Vi in such a way that G' contains a subgraph G'g whose vertex set is the union of all 
the V/ and such that 

• (^/' ^j^G'g ^■^ (V^5 d — 4:De) -super-regular whenever ij £ E{S), 

• (V^', V^Ocg ^■^ y/e-regular and has density d — ADe whenever ij £ E[R). 

5. The non-extremal argument 

This section covers the main part of the argument for the case when the original oriented 
graph G is not close to being the 'extremal graph' described in Section [3l The following 
well-known fact will be very useful. 

Proposition 8. Suppose that G is a digraph such that |A^^(S')| > IS"! for every S C V{G). 
Then G has a \- factor. 

Proof. The result follows immediately by applying Hall's matching theorem to the following 
bipartite graph H: the vertex classes A,B oi H are both copies of the vertex set of the 
original digraph G and we connect a vertex a £ A to b £ B m H if there is a directed edge 
from a to 6 in G. A perfect matching in H corresponds to a 1-factor in G. □ 

Lemma 9. Suppose 1/n <C eo 1/^ <^£<^d<^c<^l, and that R is an oriented graph 
on [k] such that 

• 6^{R) > 2ck, 

• \N^{S)\ > \S\ + ck for any S C [k] with \S\ < (1 - c)k, and 

• C is a 1-factor in R. 

Also, suppose that R* is an oriented graph obtained from R by adding a set Uq of at most Eqh 
vertices and some edges so that every x £ Uq has both an inneighbour and an outneighbour 
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in [k]. Suppose G* is an oriented graph on n vertices with vertex partition Uq, Ui, . . . , U^, 
such that 

(i) each Ui is independent; 

(ii) if ij is an edge on some cycle from C, then the bipartite graph {Ui,Uj) consisting 
of all the Ui-Uj edges in G* forms an {£,d) -super-regular pair, while if ij G E{R), 
the bipartite graph (Ui,Uj) forms an e-regular pair of density d in G* , and finally 
{Ui, Uj) is empty whenever ij ^ E{R); 

(iii) for all x G C/q o^'nd i G [k], we have \NQt{x) H > c\Ui\ whenever xi G E{R*) and 
\Nq^{x) n Ui\ > c\Ui\ whenever ix G E{R*); 

(iv) \Ui\ = ■ ■ ■ = \Uk\ =: m. 

Then G* has a Hamilton cycle. 

Proof. Throughout the proof of the lemma, given a cycle C G C and a vertex x G C, we will 
denote the predecessor of x on C by x~ and its successor by x~^. We extend this convention 
to sets of vertices in the natural way. We say that a walk W in R* is balanced with respect 
to C if for each cycle C £ C every vertex on C is visited the same number of times by W. 
We will first construct a closed walk W in R* such that 

(a) W is balanced with respect to C; 

(b) W visits every vertex in [k] at least once and at most times; 

(c) W visits every vertex in Uq exactly once; 

(d) any two vertices in Uq are at distance at least 4 along W. 

To achieve the balance property (a), we will build up W from certain special walks (which 
are themselves balanced with respect to C): a C-shifted walk joining x to y and traversing t 
cycles from C is a walk of the form aiCiaj~a2C2a^ . . . atCtaJ , where x = ai, y = and the 
Ci are (not necessarily distinct) cycles from C. (Note that this definition is slightly different 
from the corresponding one in [12].) 

We claim that for any two vertices x,y £ [k] there is a C-shifted walk in R joining x to y 
which traverses at most + 1 cycles from C. To see this, let 5^ C [k] be the set of vertices 
that can be reached from x by a C-shifted walk traversing at most i cycles from C. Then 
Si = {x~} and Si^i = (N^^Si))" for i > 1. By assumption on R, if \Si\ < {1 — c)k then we 
have IS'i+il > l^il + ck. So if i > we have l^il > (1 — c)k. The minimum semi-degree 
condition on R implies that \N^{y~^)\ > 2ck. So there is a vertex z G n N^{y^) and we 
can reach y by following a C-shifted walk to z which traverses at most i cycles from C and 
then adding zy~^Cy (where C is the cycle from C containing y.) 

For each pair x,y let W{x,y) denote such a C-shifted walk. To construct W, we start at 
vertex 1 G [k] and follow the walks T^(l, 2"), 2-2, VF(2, 3"), S'S, . . . , W/(A: - l,k-). Note 
that altogether these walks yield a single (balanced) C-shifted walk joining 1 to k~ which 
visits every vertex of R at least once. Next we aim to extend this walk to incorporate the 
vertices in Uq. For this, suppose that Uq = {vi, . . . , Vq}, say. For each i £ [q], we now select 
vertices Wi G N^,{vi), Ui G N^^{vi) and follow the walks W{k,ui), uiviwi, W{wi,U2), 
U2V2W2, ■■■ , W{wq-i,Uq), UqVqWq. Finally we close the walk by following W {wq,\~) and 
the edge l"!. Note that the resulting closed walk W is balanced with respect to C since 
this holds for each of the walks W Moreover, W traverses at least one cycle of C 
between each visit to Uq (by definition), so any two vertices in Uq are at distance at least 4 
along W . Every vertex in Uq is visited exactly once, and vertices in [k] are visited at most 
{k + |C/o|)(c-^ + 1) < -/equ times, so W has the required properties (a)-(d). 

We will use the Blow-up Lemma (Lemma E]) to 'transform' W into a Hamilton cycle of G* . 
As a preliminary step, it will be useful to consider an auxiliary (undirected) graph K* which 
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it is obtained from G* by replacing every e-regular pair (Ui, Uj) of nonzero density (i.e. those 
which correspond to edges of R) by a complete bipartite graph. The neighbourhoods of the 
vertices in Uo are not affected. We can find a Hamilton cycle Cnam 

in K* as follows. First 

we find a one-to-one mapping of to a cycle W in K* such that each vertex of Uq is 
mapped to itself and for each i S [k] visits of to a vertex i G R are mapped to distinct 
vertices in Ui; to do this we first greedily select distinct images for Ui and Wi for all i G [q], 
which is possible by assumption (iii) and the inequality \Uo\ < eon < cn/{5k) < c|[/i|/4. We 
can then complete the rest of the mapping arbitrarily since W visits every vertex i € R at 
most times. Next we will extend W to a Hamilton cycle. For each cycle C € C, let 

771(7 denote the number of times that W visits a vertex c € C (note that this number is the 
same for all vertices c G C). Fix one particular occasion on which W 'winds around' C and 
replace the corresponding path in W' by a path Pq with the same endpoints that winds 
around the 'blow-up' Ucgc of C in K* exactly m — mc + 1 times, so that it exhausts 
all the vertices in UceC ^c- Since C is a 1-factor the resulting cycle Cnam uses all vertices 
in K* , i.e. it is Hamiltonian. For future reference we note that the number of times that Pc 
winds around UceC Uc is m — mc + 1 > m — > m — ^/£Q2mk > (1 — e)m. (Here we 

used property (b) and ^ 1/A;,e.) 

Now we use the version of the Blow-up Lemma by Csaba (Lemma [6| to show that Cnam 
corresponds to a Hamilton cycle in G* . We fix additional constants e',5' with e <C e' <C 
5' <C d. We will apply the Blow-up Lemma with H being the underlying graph of Cnam 
and G' being obtained from the underlying graph of G* as follows: if xi G E[R*) with 
X G f/o and i G [k], then delete all those edges which correspond to edges oriented from Ui 
to x; similarly, if ix G E{R*) with x G C/q and i G [fc], then delete all those edges which 
correspond to edges oriented from x to Ui. This deletion ensures that when we embed 

am as an undirected cycle in G' the directions of the available edges will in fact make 
it a directed cycle in G* . For all i > 0, Ui will play the role of Vi in the Blow-up Lemma 
and we take Lq, Li, . . . , to be the partition of H induced by Vq, Vi, . . . , Vk- (j) : Lq Vq 
will be the obvious bijection (i.e. the identity). To define the set / C V{H) of vertices of 
distance at least 4 from each other which is used in the Blow-up Lemma, for each C G C 
let P[j be the subpath of H corresponding to the path Pc (defined in the construction of 
Cnam)- Note that |P^| = \Pc\ > (1 - e)m\C\. For each i = 1, . . . , fc, let Cj G C denote the 
cycle containing i and let Jj C L, consist of all those vertices in Lj n V{Pl-,,) which have 
distance at least 4 from the endvertices of P^^ . Thus in the graph H each vertex u £ Ji 
has one of its neighbours in the set corresponding to the predecessor of i on Cj and its 
other neighbour in the set Lf corresponding to the successor of i on Cj. Moreover, all the 
vertices in Jj have distance at least 4 from all the vertices in Lq and the lower bound on 
I Pel implies that | Jj| > 9m/10. It is easy to see that one can greedily choose a set /j C Jj 
of size m/10 such that the vertices in IJ(Li have distance at least 4 from each other. Let 
/ := Lo U Uti li. 

Let us now check conditions (C1)-(C9). (CI) holds with Ki := 1 since |Lo| = \Uo\ < 
Eon < dn. (C2) holds by definition of /. (C3) holds since H \s a. Hamilton cycle in K* and 
K* inherits the independence property (i) in the statement of the lemma from G* . This 
also implies that for every edge xy G H with x G Li,y G Lj > 1) we must have that 
ij G E{R) (provided that the direction of xy was chosen such that xy G Cnam)- Thus (C6) 
holds as every edge of R corresponds to an e-regular pair of clusters having density d. (C4) 
holds with K2 ■= 1 because 

\Nh{Lo) n Li\ < 2\Lo\ = 2\Uo\ < 2eon < dm. 
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(The final inequality follows from the fact that Eq <C l/k,d.) For (C5) we need to find a set 
L> C I of buffer vertices. Pick any set Di C /j with \Di\ = 5'm and let D := |Ji=i -^i- Since 
/j C Jj we have that |iVj^(D) Ci Lj\ = 25'm for all j = 1, . . . , k. Hence 

\\NH{D)nL,\-\NH{D)nLj\\ = 

for all 1 < i < j < and so (C5) holds. (C7) holds by assumption (iii) on R* and the fact 
that our walk W used only edges of R* . 

(C8) and (C9) are now the only conditions we need to check. Given a set Ei CVi = Ui of 
size at most e'm, we wish to find Fi C (Lj H (/ \ D)) = Ii \ D and a bijection (pi : Ei ^ Fi 
such that every v £ Ei has a large number of neighbours in every cluster Vj for which Lj 
contains a neighbour of (f)i{v). Pick any set -Fj C /j \ D of size \Ei\. (This can be done since 
\D n Ii\ = 5'm and so |/j \ > m/10 — 5'm ^ e'm.) Let : Ei ^ Fi be an arbitrary 
bijection. To see that (C8) holds with these choices, consider any vertex v & Ei Vi and 
let j be such that Lj contains a neighbour of 4>i{v) in H. Since (t>i{v) £ Fi C li (1 Ji^ this 
means that j must be a neighbour of i on the cycle Ci € C containing i. But this implies 
that \Nq/{v) CiVjl > {d — e)m since each edge of the union UcgC C R oi all the cycles 
from C corresponds to an (e, (i)-super-regular pair in G' . 

Finally, writing F := IJi=i have 

\NH{F)nLi\ < 2e'm 

(since Fj C Jj for each j = 1, . . . ,k) and so (C9) is satisfied with := 2. Hence (C1)-(C9) 
hold and so we can apply the Blow-up Lemma to obtain a Hamilton cycle in G' such that 
the image of Lj is Vi for alH = 1, . . . , fc and the image of each x £ Lq is (j){x) G Vq = Uq. By 
construction of G' this undirected Hamilton cycle corresponds to a directed Hamilton cycle 
in G*. □ 

6. Proof of Theorem □ 

Define M' , Mq G N and additional constants so that 

1/no < 1/M^ < £0 < c?o < <e<c?<c<r/«;l. 

Suppose G is an oriented graph on n > no vertices with minimum semi-degree 5'^{G) > ^"g~^ 
and no Hamilton cycle. Apply the Diregularity Lemma (Lemma [3|) to G with parameters 
d, M' to obtain a partition Vq, Vi, . . . , of V{G) with A; > M'. Let i? be the reduced 
oriented graph with parameters (e^/3, d) given by applying LemmaSland let G* be the pure 
oriented graph. Lemma [Dj^a) and (b) implies 

(2) 5^{R) > (3/8 - l/(2n) -d- e'^)k > (3/8 - 2d)k. 

We divide the remainder of the argument into two cases, according to whether there is a 
set S C [k] with k/3 < \S\ < 2k/3 and \N^{S)\ < \S\ + 2ck. 

Case 1. |iV^(5)| > |5| + 2ck for every S C [k] with k/3 < \S\ < 2k/3. 

Note that if / |5| < A:/3 then |iV+(S)| > 5+{R) > \S\+d^k (say), andif |S| > 2A;/3then 
l-^l + I^^WI > k for any i G [k], i.e. S D N^{{) / 0, and 7V+(5) = [k]. So |iV^(5)| > |5| 
for every S G [k], and Proposition [8] implies that there is a 1-factor C in R. In fact, 

(3) \N+{S)\>\S\+d'k 

for every S C [k] with d^k < \S\ < (1 - d'^)k. 

We would now like to apply Lemma [9] to R, C and G* to find a Hamilton cycle. However, 
the size of the exceptional set Vq may be larger than the number of exceptional vertices 
allowed for by the lemma. To deal with this problem, we first partition V{G) \ Vq into sets A 
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and B. Then we further refine the above 'regularity partition' within Vq to obtain a very 
small 'exceptional set' Aq. It then turns out that we can use Lemma [9] to find a Hamilton 
path in U Vq) \ Aq and a Hamilton cycle in B U Aq which together form a Hamilton cycle 
in G*. 

By adding at most k vertices to Vq we may assume that each Vi contains an even number 
of vertices. 

Claim 1.1. There is a partition of V \Vo into sets A and B which has the following 
properties: 

• \Ai\, \Bi\ = ^\Vi\, where we write Ai := ViCi A and Bi := Vi Ci B for every i € [A;]; 

• \N^ix) n Ai\,\N^{x) n Bi\ = ^\N^{x) nVi\ ± r?!"^ for every vertex x £ G; and 
similarly for Nq(x); 

• R is the oriented reduced graph with parameters (e^, 3d/ A) corresponding to the parti- 
tion Ai, . . . , Ak of the vertex set of G* [A] ; this also holds for the partition Bi, . . . , Bf^. 

Proof. For each cluster Vi consider a random partition of Vi into two sets Ai and Bi obtained 
by assigning a vertex x £ Vi to Ai with probability 1/2 independently of all other vertices 
in Vi. So A := l^i^i Ai and B := |Ji=i ^i- Then the probability that = is at least 
l/(3Y^|Vi|) (see e.g. j!, p. 6]). Also, standard Chernoff bounds imply that the probability 
that there is a vertex x G G whose degree in Ai is too large or too small is exponentially 
small in |Vi|. So with non-zero probability we obtain a partition as desired. To see the third 
property, note that the definition of regularity implies that the pair (Ai, Aj)G* consisting of 
all the Ai-Aj edges in G* is e^-regular and has density at least 3d/4 whenever ij £ E(R). 
On the other hand, {Ai,Aj)G* is empty whenever ij ^ E{R) since (Vi,Vj)G* ^ {Ai,Aj)G* is 
empty in this case. □ 

Apply Proposition [71 with G*[i3] playing the role of G*, C playing the role of S (so D = 2), 
the Bi playing the role of the Vi and e,d replaced by e^,3(i/4. We obtain an (oriented) 
subgraph G'^ of and sets B'- C Bi of size 4e^|i?j| so that a pair {B[, B'-)qi^ is e-regular 

of density 3d/4 — > d/2 whenever ij G E{R), and so that {B[,B'-)qi is (e, (i/2)-super- 
regular whenever ij is an edge of C. Let B' := B'^\J ■ ■ ■ \J B'f^. So V{G'^) = B'. Let Vq be 
obtained from Vq by adding the vertices in Bi \ B[ for all i G [k\. Since |Vo| < (e^/3)n + k 
and \Bi\B'^^ < 4:e^\Bi\ we have 

1^0 1 < 

Claim 1.2. Each set Ai (i S [k]) contains a set Ui of size 2e\Ai\ such that 5^{G*[A\U]) > 
dn/8, where U := IJi=i ^i- 

Proof. Consider any cluster Ai. Claim 1.1 implies that for all j G -^/j (0 most e^|^i| 
vertices in Ai have less than d\Aj\/2 outneighbours in Aj. Call these vertices useless for j. 
So on average any vertex of Ai is useless for at most e'^\N^{i)\ indices j G N^{i). Thus at 
most e\Ai\ vertices in Ai are useless for more than e\N^{i)\ indices j G N^ii). Let C/| C Ai 
be a set of size £\Ai\ which contains all these vertices and some extra vertices from Ai if 
necessary. Similarly, there is a set U" C Ai\Ul of size £\Ai\ such that every vertex v G Ai\U[' 
has at least d|Aj|/2 inneighbours in Aj for all but at most e\N^{i)\ indices j G N^{i). Let 
Ui := U'i U U'i and U := U!Li Ui. Thus \U\ = 2e\A\ and every vertex v £ Ai\Ui satisfies 

d-^a'[A\u]^^)>2^^-^)\KmAi\-\U\ > -. 
The same bound holds for d^,^^^^^{v) and thus 5^{G*[A \ U]) > dn/8. U 
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Add all the vertices in U to Vq to obtain a new set Vq with 

|Fo"l < 3en. 

Let be the subgraph of G obtained from G*[A \ U\ by adding the set Vq and setting the 
out- and inneighbourhoods of any v S Vq in equal to NQ{v)r\{A\U) and NQ{v)'r\{A\U). 
Thus 5^{G'^) > d\G'^\/8 by Claims 1.1 and 1.2. In what follows, we will still write Ai for 
Ai \ Ui and A for A\U. Then in G'^ each pair (Ai, Aj)Qi^ with ij £ E{R) is still e-regular 
and has density at least d/2. 

Apply the Diregularity Lemma (Lemma [3]) to with parameters Eg, 2do, Mq and ini- 
tial partition Vq , Ai, . . . , A^ to obtain a reduced digraph R'^ and a partition Aq, A[, . . . , A'^ 
(where the exceptional set is A'q) which refines Vq , Ai, . . . , Aj.. Apply Lemma[l]to (play- 
ing the role of G), i?^ (playing the role of R!) and e^, 2do, Mq (playing the roles of e, d, M'). 
We obtain a reduced oriented graph Ra with parameters (sq, 2(io) such that 

5^{Ra) > d£/8 - {3el + 2do)e > d£/W 

(this is what we need Claim 1.2 for). Let G^ denote the corresponding pure oriented graph. 

Claim 1.3. For any T C [£] with 2d^i < \T\ < (1 - 2d'^)l we have \N+^{T)\ > \T\ + d^£/2. 

Proof. Let m := \Ai\ = \A\/k, for 1 < i < k, and m' := \A'^\, for 1 < x < £. Then 
1^1 + \V^'\ = im' + \A'q\. Since IV;,"! < Sen we get m' < i\A\ + 3en)/£ < (1 + 7e)\A\/£, and 
since |^^| < el{\A\ + IFq"!) < egn we get m' > (|^| -eln)/£ > (1 - 3eg)| For i £ [k] let 
Li := {x G [^] : A'^ C ^J. Then 

'^^'-m'- km' - km' < H (1 " 3eg)| A| J < + 

Let Z := {i £ [k] : |Lj| < (1 — 10e)£/k}. We will show that Z is comparatively small. For 
this, note that if i G Z, then 

\Ai \ A'q\ = \Li\m' < (1 - We){e/k){l + 7e)\A\/e < (1 - e)\A\/k. 

Since \Ai\ = m = \A\/k, this in turn shows that \Ai D A'q\ > e\A\/k. So \Z\ > ek would 
imply that |^q| > \Z\e\A\/k > e^|A| > EqU, contradiction. Thus we must have \Z\ < ek. 

Let Ti := T n Li, S := {i £ [k]: \Ti\ > d^£/k} and T' := Ues^i- Then \Ti\ < \Li\ < 
(1 + 3el)£/k and \T \ T'\ < {k - \S\)dH/k + \{x £ [£] : A'^^ C Vf^'}\ < {d^ + 7e)£ imply that 

(4) |5|>^^-^>|r|./^-2d»t. 

Therefore \S\ > d^k and so ^ implies that either \S\ > (1 - d^)k or \N+{S)\ > \S\ + d^k. 
Suppose first that the latter holds. We will now show that for all i G and all j £ N^{i) 
we have 

(5) \L,\N+jT)\<d'£/k. 

Indeed, if not then Wi := UxGTi ^^'x ^-nd Wj := U^gl \Af+ (T) ^'x ^ would span a 

bipartite subgraph {Wi, Wj)Qi^ of {Ai, Aj)Qi^ with vertex class sizes \Wi\, \Wj\ > \A'^\d^£/k > 
em. Note that eRj^{Ti,Lj \ N^^{T)) = 0. So Lemma [H (c) applied to Tj (playing the role 
of S) and Lj \ (T) (playing the role of T) shows that 

cg' {W^,Wj) < 6(io|G^p < edon^ < 7do{mkf < < 7|VFi||VF,|, 

A £ 4 
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where the last inequahty holds since ^ e and thus also do <C 1/k. This contradicts the 
fact that {Ai, Aj)Qi^ is an e-regular pair of density at least d/2, which proves ([5]). Therefore 

I^A.(^)I> E m-dH/k)>{\N+{S)\-\Z\){l-2d')i/k 
> {\S\ + d'^k - ek){l - 2d^)£/k § |T| + d^e/2. 



The argument in the case when \S\ > (1 — 
beginning of Case 1 we now have N^{S) = 
(1 - 3d^)i > \T\ + d'^i/2. 



d?')k is almost the same: as mentioned at the 
[k] and so |A^^^(T)| > (A: - ek){l - 2d^)i/k > 

□ 



Now the same argument that we used to find the 1-factor C in i? gives us a 1-factor 
Ca in Ra- Apply Proposition [71 with G'^ playing the role of G*, Ca playing the role of S 
(so D = 2), the A'- playing the role of the Vi and £,d replaced by eQ,2do. We obtain an 
(oriented) subgraph G'^ of G'^ and sets A'- C A'- so that a pair {A'l,A'-)Qm is eg-regular of 
density 2dQ — Se'^ > do whenever ij G E{Ra), and so that {A'-, A'-)qiii is {eq, do) -super-regular 
whenever ij is an edge of Ca- Let A' := A'(U - - - UA'^ and let Aq be the set obtained from A'q 
by adding ah the vertices in A- \ A'^ for ah i £ [£]. Then V{G'^) = A' and 

1^0 1 < eln + Aeln < eon. 

Let G*^ be the subgraph of G obtained from G'^ by adding the set Aq and setting the out- 
and inneighbourhoods of any v G Aq in G*^ equal to Nq{v) n B' and Nq{v) H B'. We will 
apply Lemma [9] to find a Hamilton cycle in G^. For this, note that the reduced oriented 
graph R defined above satisfies the initial two conditions of Lemma[9] (c.f. the discussion just 
before the statement of Claim 1.1). Moreover, we will let the vertex partition B[, . . . , B'f^ 
of G*^ — Aq play the role of Ui, . . . ,Uk in the lemma. The role of Uq will be played by Aq. 
Now define an oriented graph R*^ with vertex set [k] U Aq as follows. The restriction of i?^ 
to [k] is R. For each x G Aq add an edge from x to i G [k] if \Nq{x) H B'-\ > c\Bl\ and an 
edge from i G [k] to x if \N^{x) r\ B'^\ > c|i?'|. Using Claim 1.1, it is easy to see that x has 
indegree and outdegree at least (3/8 — ^fc)k. However there may be double edges, so delete 
edges to obtain an oriented graph (which we still call i?^) in which each x G A!q has indegree 
and outdegree at least (3/16 — ^/c)k > k/10 (say, although we only need these degrees to be 
non-zero). Now applying Lemma to the oriented graphs R, R*^ (playing the role of R*) 
and G*^ (playing the role of G*) gives a directed Hamilton cycle Cb in G*^. 

Choose any vertex v of G*^ and let be the vertex that succeeds v on Cb- Let G\ be the 
digraph obtained from G'^ by adding a new vertex which has the following neighbours: 
the outneighbourhood of v^, is Nq{v) n A' and the inneighbourhood is Nq{v^) n A'. Let 
R\ be the digraph obtained from Ra by adding a new vertex and edges as follows. 
Add an edge from to z G [i] if \Nf^{v) H yl^| > c\A'-\ and an edge from i G [i] to if 
\N^{v'^) n A'^\ > c\A[\. Using Claim 1.1 it is easy to see that has indegree and outdegree 
at least (3/8 — ^/c)i. Again, we can delete edges to arrive at an oriented graph i?^ in which 

has indegree and outdegree at least (3/16 — \/c)£ > £/10. Now Lemma [9] applied to Ra 
(playing the role of R), Ca (playing the role of C), R\ (playing the role of i?*), G\ (playing 
the role of G*) and with 1/|G^|, eo,do,d'^ /2 playing the roles of So, k, e, d, c gives a directed 
Hamilton cycle Ca in C^. Note that this yields a Hamilton path in G'^ which starts in 
and ends in v~ , where and v~ are the successor and predecessor of on Ca- Now 
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V^Cbw^Cav^V^ is a Hamilton cycle in G* and thus in G. This contradiction completes 
the analysis of Case 1. 

Note that the argument in the proof of Case 1 implies the following lemma, which will be 
used in [17] to prove an approximate analogue of Chvatal's theorem on hamiltonian degree 
sequences for digraphs. 

Lemma 10. Let M',no be positive numbers and let £,d,r],u,T be positive constants such 
that 1/no <^ 1/M' <Ce<Cc?<CJ^<T<Cr?<l. Let G be an oriented graph on n > no 
vertices such that 6^{G) > 2r/n. Let R' be the reduced digraph of G with parameters {£,d) 
and such that \R'\ > M' . Suppose that there exists a spanning oriented subgraph R of R' 
with 5^{R) > 7]\R\ and such that \N+{S)\ > \S\ + u\S\ for all sets S C V{R) with t\R\ < 
\S\ < {1 — t)\R\. Then G contains a Hamilton cycle. 

Let us now continue with the second case of the proof of Theorem [TJ 

Case 2. There is a set S C [k] with k/3 < \S\ < 2k/3 and \N^{S)\ < \S\ + 2ck. 

The strategy in this case is as follows: by exploiting the minimum semi-degree condition, 
we will first show that G has roughly a similar structure as the extremal example described 
in Section [3l In a sequence of further claims, we will then either find a Hamilton cycle or 
obtain further structural information on G which means that it must be even more similar to 
the extremal example. Eventually, we arrive at a contradiction, since being (almost) exactly 
like the extremal example is incompatible with the minimum semi-degree condition. Unless 
stated otherwise, all neighbourhoods, degrees and numbers of edges refer to the oriented 
graph G from now on. Let 

An -.= 3 N+{S), Br := N+iS) \ S, Cr := [k] \ (S U N+iS)), Dr:=S\ N+iS). 

Let A := UieAi; ^ define B,G,D similarly. By definition we have eR{AR,GR) = 
^r{Ar, Dr) = eR{DR,GR) = eR{DR) = 0. Since R has parameters (e^/3, d), Lemma|Hc,d) 
implies that we have 

(6) e{A, C), e{A, D), e{D, C), e{D) < 3dn'^. 

Prom now on we will not calculate explicit constants multiplying c, and just write 0(c). The 
constants implicit in the 0(*) notation will always be absolute. 

Claim 2.1. \A\,\B\,\C\,\D\ = (1/4 ± 0(c))n. 

Proof. First we show that Ar, Br, Cr and Dr are non-empty. Since the average value of 
\N^{x) n 5| over all x ^ S is less than |S'|/2, we have 

11 

\Br\ > 6+{R) - \S\/2 > (3/8 - 2d)k - k/3 > k/30. 

Also \Dr\ = \Br\ + l^l - \N^{S)\ > k/30 - 2ck > 0. Since e/j(Z)/?) = 0, for any x€ Drwb 
have \Nr{x)\ < \Ar\ + l^^l + \Cr\ = \N+{S)\ + \Gr\. Thus 

11 

\Cr\ > 2(3/8 - 2d)k - \N+{S)\ > 2(3/8 - 2d)k - (2/3 + 2c)k > 

and also \Ar\ = \Gr\ + \N+{S)\ + |5| - A: > 2(3/8 - 2d)k + \S\-k>0. 

Pick a vertex ur £ Dr whose degree in R is minimal, a vertex vr £ Ar whose outdegree in 
R is minimal and a vertex wr G Cr of whose indegree in R is minimal. Since the minima are 
at most the averages, inequality ([2]) implies that 2(3/8 — 2d)k < d{uR) < \Ar\ + \Br\ + \Cr\, 
(3/8 - 2d)k < d+{vR) < \Ar\/2 + \Br\ and (3/8 - 2d)k < d'iwR) < \Br\ + \Cr\/2. We also 
have the inequality \Br\ - \Dr\ = \N+{S)\ - \S\ < 2ck. Thus we may define positive reals 
TA, TB, rc, ro by 
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. rA := \AR\/2 + \BR\-{3/8-2d)k, 

• TB ■.= l{\DR\-\BR\+2ck), 

. rc ■.= \BR\ + \CR\/2-{3/8-2d)k, 

. rn := \Ar\ + \Br\ + \Cr\ - 2(3/8 - 2d)k. 

Then 

rA + rB + rc + rD = ^{\Ar\ + \Br\ + \Cr\ + \Dr\) + 3ck - 4(3/8 - 2d)k = {3c + 8d)k < ick. 

This in turn imphes that 

• \Dr\ = k- {\Ar\ + \Br\ + \Cr\) = k- 2(3/8 - 2d)k -rD = k/4 ± 5ck, 

• \Br\ = \Dr\ + 2ck - IrB = k/4 ± lOcfc, 

• \Ar\ = 2((3/8 - 2d)k - \Br\ + r^) = k/4 ± 30cfc and 

• \Cr\= 2((3/8 - 2d)k - \Br\ + rc) = k/4 ± 30c/c. 

Altogether, this gives \A\,\B\,\C\,\D\ = (1/4 ± 31c)n. □ 

Claim 2.2. 

• e{A) > (l/2-0(c))nVl6, 

• e{A,B) > (l-0(c))nVl6, 

• e(B,C) > (1 -0(c))nVl6, 

• e(S,L>) > (l/2-0(c))nVl6, 

• e(C) > (1/2 - 0(c))nVl6, 

• e(C,L') > (l-0(c))nVl6, 

• 6(1?,^) > (l-0(c))nVl6, 

• e{D,B) > (l/2-0(c))nVl6. 

Proo/. Since e{A,C),e{A,D) < 3dn^ by ([6]) we have 

^ (i+(x) < \A\'^/2 + l^ll^l + 6dn^ = (3/2 + 0(c))nVl6. 

xGA 

On the other hand, Y.xeAd'^i^) > \A\{3n - 4)/8 = (3/2 - 0{c))n'^ /16. So we must have 
e{A) > (1/2 - 0(c))nVl6 and e(^,B) > (1 - 0(c))nVl6. Also, since e{A,C),e{D,C) < 
3dn^ we have 

(3/2 - 0(c))nVl6 < ^"(^) < l^ll*^! + l'^lV2 + edn^ = (3/2 + 0{c))n^ /16, 

so e(C) > (l/2-0(c))nVl6 and e(B,C) > {1 - 0{c))n^ / 16. Next, writing ;D := >1USUC 
and using the inequalities e{D), e{D, C), e{A, D) < 3dn'^ gives 

(3- 0(c))nVl6 < e{D,D) + e(D,D) + 2e{D) 

< e{D, A) + e{D, B) + e{B, D) + e(C, L>) + 12(in2 

< \D\{\A\ + \B\ + |C|) + 12dn2 = (3 + 0(c))nVl6, 

soe(Z),A) > (l-0(c))nVl6 ande(C7,D) > (1 - 0(c))nVl6. Finally, since e(D, C), e(D) < 
3(in^ we have 

(3/2 - 0(c))nVl6 < d+{x) < \ A\ \D\ + e{D, B) + Gdn^ < (1 + 0(c))nVl6 + e(L», 

xeD 

and so e(i:),B) > (1/2 - 0(c))nVl6. Since e(yl, e(Z)) < 3dn'^ we have 
(3/2-0(c))nVl6 < Yd-{x) < e{B , D) + \C\\D\ + 6dn^ < {I + 0{c))n'^ / 16 + e{B, D) 

x&D 
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and so e{B, D) > (1/2 - 0(c))nVl6. □ 

Henceforth we will only use Claims 2.1 and 2.2 and make no further use of the information 
in ([6|). This has the advantage of making our picture invariant under the relabelling A C, 
B ^ D. Also, we add Vq to A. Since |Vo| < (e^/3)n and e <C c, this does not affect the 
assertions of Claims 2.1 and 2.2 (except as usual the constant in the O(c)-notation). It will 
sometimes be convenient to use the notation A := P{1), B := P(2), C := P(3), D := P(4). 

Given a vertex x £ G, we will use the compact notation x has property W : AlB*C*Dl 
as follows. The notation starts with some W G {A, B, C, D}, namely the set that x belongs 
to. Next, for each of A, B, C, D its superscript symbol refers to the intersection with N~^{x) 
and its subscript to the intersection with N~{x). A symbol ' >a' describes an intersection 
of size at least (a — 0{y/c))n/ A and a symbol ' < a' describes an intersection of size at 
most (a + 0(y^))n/4. The absence of a symbol means that no statement is made about 
that particular intersection, and we can omit any of A,B,C,D if it has no superscript or 
subscript. For example, to say x has property BiA-^^C^y^ means that x G |iV+(x)nA| > 
(1 - 0(^))n/4, \N+{x) nC\< (1/2 + 0{^/c))n/4 and \N-{x) nC\< (1/3 + 0(^))n/4. 

We say that a vertex x is cyclic if it satisfies 

for some i < 4 (counting modulo 4). Claim 2.2 implies that at most 0{^/c)n vertices are 
not cyclic. We say that a vertex is acceptable if it has one of the following properties 

./i>i/i00r) , A- A , R>i/ioo 4.4>V100 



A:B>y^^^D^y,,,^A 



B:A^y,ooC>'/''^B 
C:B^,^,,oD>'/''',C 
D:A>y^^^C^,/,,o,D 



f> r'>i/ioo ri-n n>i/ioo ^.^>i/ioo 

-D>1/100'-^ 7 •'-^>i/ioo-f^ ' •'-">1/100' 

4>i/ioor , r).R>i/ioo^ , n-R>^/^°° 



In other words, a vertex is acceptable if it has a significant outneighbourhood in one of its 
two out-classes and a significant inneighbourhood in one of its two in-classes, where 'out- 
classes' and 'in-classes' are to be understood with reference to the extremal oriented graph 
in Section [3l So for example A has out-classes A and B and in-classes A and D. We will 
also call an edge acceptable if it is of the type allowed in the extremal oriented graph (so for 
example an edge from A to is acceptable but an edge from i? to A is not). Note that a 
cyclic vertex is also acceptable. 

In what follows, we will carry out 0{y/c)n reassignments of vertices between the sets A, 
B, C and D (and a similar number of 'path contractions' as well). After each sequence of 
such reassignments it is understood that the hidden constant in the 0(-y/c)-notation of the 
definition of an acceptable/cyclic vertex is increased. 

Claim 2.3. By reassigning vertices that are not cyclic to A, B, C or D we can arrange 
that every vertex of G is acceptable. We can also arrange that there is no vertex that is not 
cyclic but would become so if it was reassigned. 

Proof. We start by making any reassignments necessary to satisfy the second statement of 
the claim. (To do this, we reassign all the vertices in question in one step. As we are 
reassigning 0{y/c)n vertices we can then change the hidden constant in the 0(-y/c)-notation 
to make sure that the vertices which were cyclic before are still cyclic.) 

To satisfy the first statement of the claim, for any vertex x € G we let P^ := {1 < i < 4 : 
\N+{x)nP{{}\ > n/400}, P- ■.= {l<i<i: \N-{x)nP{i}\ > n/400}, and P^ := P+UP'. 
By the minimum semi-degree condition |P^| > 2, \P^\ > 2 and \Px\ > 3. If there is some i 
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such that i + 1 £ P+ and i — 1 £ P~ (where we use addition and subtraction mod 4) then 
we can put x into P{i) and it will have property P{i) ■.P{i — l)>i/ioo^(^ + 1)-*^/^'''^, i.e. x 
will become acceptable. Otherwise we must have either P+ = {1,3} and P^ = {2,4}, or 
P,jr = {1,3} and P^ = {2,4}. In either case we can put x into A = P(l): in the first case 
it will have property A:A-^^^^'^^D~^i/iqq and in the second case property A: Ayi/iQQB^^^^^'^ . 
So in both cases x will become acceptable. As before, by increasing the hidden constant in 
the 0(y^)-notation if necessary, we can ensure that the properties of all other vertices are 
maintained. □ 



In the proof of the following claims, contractions of paths will play an important role. 
Given a path P in G whose initial vertex pi and final vertex p2 lie in the same class P{i), 
the contraction of P yields the following oriented graph H: we add a new vertex p to the 
class P{i) and remove (the vertices of) the path P from G. N^{p) will be N~^(j)2) H P{i + 1) 
and Njj{p) will be N~ {pi)r\P{i — 1). Note that any cycle in H which includes p corresponds 
to a cycle in G which includes P. The paths P that we use in the proof of Claim 2.4 will 
have initial and final vertices in the same class and will be acceptable, meaning that every 
edge on P is acceptable. Note that each such acceptable path P must be BD-balanced, 
meaning that if we delete the initial vertex of P we are left with a path that meets B and 
D the same number of times. This may be seen from the observations that visits of P to 
B U D alternate between B and D, and if the path is in A and then leaves it must visit B 
and D an equal number of times before returning to A (and similarly for C). Note that if 
we have \B\ = \D\ and contract a SD-balanced path, then the resulting digraph will still 
satisfy \B\ = \D\. The 'moreover' part of the following claim is used later in the proof to 
turn a graph with = |D| + 1 into one with \B\ = \D\ under certain circumstances. 

Claim 2.4. // \B\ = \D\ and every vertex is acceptable then G has a Hamilton cycle. 
Moreover, the assertion also holds under the following slightly weaker assumption: \B\ = \D\ 
and there exists some vertex x such that every vertex except x is acceptable, there is at least 
one acceptable edge going into x and at least one acceptable edge coming out of x. 

Proof. We will use the 'standard' version of the Blow-up Lemma (Lemma [5]) to prove the 
first assertion. For this, the idea is to first find suitable paths which together contain all 
the non-cyclic vertices. We will contract these paths into vertices so that the resulting 
oriented graph Gi consists entirely of cyclic vertices. Then we will find suitable paths whose 
contraction results in an oriented graph G2 which satisfies |^| = \B\ = \C\ = \D\ and all of 
whose vertices are cyclic. We can then apply the Blow-up Lemma to its underlying graph 
to find a directed Hamilton cycle in G2 which 'winds around' A, B, G, D. The choice of our 
paths will then imply that this Hamilton cycle corresponds to one in G. 

Let vi,. . . ,vt be the vertices which are acceptable but not cyclic. For each Vi, choose a 
cyclic outneighbour vf and a cyclic inneighbour v~ so that all of these vertices are distinct 
and so that the edges Vivf and v^Vi are acceptable. Note that this can be done since 
t = 0{y/c)n. Let P/ be a path of length at most 3 starting at vf and ending at a cyclic 
vertex which lies in the same class as v~ , and where the successive vertices lie in successive 
classes, i.e. the successor of a vertex x S V{P) n P{i) lies in P{i + 1). (So if for example 
Vi has the first of the acceptable properties of a vertex in A, then we can choose vf G B, 
v^ G D and so P/ would have its final vertex in D. Also, if v~ and vf lie in the same class 
then P/ consists of the single vertex vf .) Again, the paths P/ can be chosen to be disjoint. 
Let Pi := v~VivfPf Then the Pj are acceptable and thus PD-balanced. Let Gi be the 
oriented graph obtained from G by contracting the paths Pj. Then every vertex of G\ is 
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cyclic (by changing the constant involved in the 0{^/c) notation in the definition of a cyclic 
vertex if necessary). Moreover, the sets A,B,C,D in Gi still satisfy \B\ = \D\ and we still 
have that the sizes of the other pairs of sets differ by at most 0{^/c)n. 

Now suppose that \A\ < \C\ and let s := \C\ — \A\. Greedily find a path Pq of the form 

CCDAB . . . CCDAB C 

^ V ' 

s times 

consisting entirely of cyclic vertices. So Pc starts with an edge between two cyclic vertices 
in C. (Claim 2.2 implies that almost all (unordered) pairs of (cyclic) vertices in C are joined 
by an edge.) Then the path Pc uses one cyclic vertex in D, one in A etc. Let G2 be 
the digraph obtained by contracting Pc- Then in G2, we have \A\ = \C\ and \B\ = \D\. 
If |j4| > \C\ we can achieve equality in the similar way by contracting a path Pa of the 
form AABCD . . . AABCDA. Note that since s = 0(y^)n, all vertices of G2 are still cyclic. 
Now suppose that in G2 we have \B\ > \A\. Let s := \B\ — \A\. This time, we greedily find 
a path Pb of the form 

BDABCDBCDA . . . BDABCDBCDAB 

^ V ' 

s times 

consisting of cyclic vertices. To see that such a path exists, note that Claim 2.2 implies that 
there are at least |-B|/4 (say) cyclic vertices in B with at least |-D|/4 cyclic outneighbours 
in D. So Pb starts in such a cyclic vertex in B and then uses cyclic vertices in D, A, 
B and C. Since there are at least |-D|/4 cyclic vertices in D having at least |-B|/4 cyclic 
outneighbours in B, Pc can then move to such a cyclic vertex in D and use cyclic vertices 
in B, C, D and A etc. Note that Pb is -BD-balanced. By contracting Pb, we obtain an 
oriented graph (which we still call G2) with \A\ = \B\ = \C\ = \D\ and all of whose vertices 
are still cyclic. Finally, suppose that in G2 we have \B\ < \A\. In this case we can equalize 
the sets by contracting two paths Pa and Pc as above. 

Thus we have arranged that \A\ = \B\ = \C\ = \D\ in G2. Let G2 be the underlying 
graph corresponding to the set of edges oriented from P{i) to P{i + 1), for 1 < "i < 4. 
Since all vertices of G2 are cyclic and we chose c <C <C 1, each pair {P{i),P{i + 1)) is 
(77, l)-super-regular in G'2- Also, G'2 is simple, i.e. there are no multiple edges. Let F' be 
the 4-partite graph with vertex classes A = P{1), B = P{2), C = P{3), D = ^(4) where the 
4 bipartite graphs induced by {P{i),P{i + 1)) are all complete. Clearly F' has a Hamilton 
cycle, so we can apply Lemma [5] with A; = 4, A = 2 to find a Hamilton cycle Cnam in G'2- 
The 'moreover' part of Lemma [5] implies that we can assume that Cnam continually 'winds 
around' A, B, C, D, i.e. one neighbour on Cnam of a vertex x G P{i) lies in P{i + 1) and the 
other in P{i — 1). Cnam corresponds to a directed Hamilton cycle in G2 and thus in turn to 
a directed Hamilton cycle in G. 

Now, we deduce the 'moreover' part from the first part of Claim 2.4. Similarly as in the 
first part, the approach is to find a suitable path P containing x which we can contract into 
a single vertex so that the resulting oriented graph still satisfies \B\ = \D\ and now all of its 
vertices are acceptable. Choose x~ and x+ so that x~x and xx'^ are acceptable edges. Since 
x~ is acceptable, it has a cyclic inneighbour x so that the edge x x~ is acceptable. Let 
P{i) be the class which contains x . Let P' be a path of length at most 3 starting at x'^ 
and ending at an cyclic vertex in P{i) so that successive vertices lie in successive classes. Let 
P := x x~xx^P' . Then P is acceptable and thus SD-balanced. Let H be the oriented 
graph obtained from G by contracting P. Then in H we still have \B\ = \D\. All vertices 
that were previously acceptable/cyclic are still so (possibly with a larger error term 0{y/c)). 
Since x and the terminal vertex of P are both cyclic (and thus acceptable), this means 
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that the new vertex resulting from the contraction of P is still acceptable. So we can apply 
the first part of the claim to obtain a Hamilton cycle in H which clearly corresponds to one 
in the original oriented graph G. □ 

The picture is still invariant under the relabelling A <^ C, B <^ D, so we may assume 
that \B\ > \D\. Since we are assuming there is no Hamilton cycle, Claim 2.4 gives \B\ > \D\. 

Claim 2.5. For each of the properties A:C>i/ioo, ^:^>i/ioo, C:A>Vioo^ C:5>Vioo^ ^j^^^^ 
are less than \B\ — \D\ vertices with that property. 

Proof. Suppose to the contrary that for example we can find — ID] =: t vertices vi,... ,vt 
in A having the property A : C>i/ioo- Select distinct cyclic vertices . . in C with 
€ iV~(t;j) for 1 < i <t. This can be achieved greedily, as \B\ — \D\ = 0{y/c)n < n/1000, 
say. Since vi, . . . ,vt are acceptable vertices in A we can greedily select distinct cyclic vertices 
G Ar+(i!j) n U B). For each i such that vfeB let bf := vf. For each i such that 
6 A select bf to be a cyclic vertex in N~^{vf) H B; we can ensure that bi,...,bf are 
distinct. Similarly we can select cyclic vertices b~ G N^{v~) D B which are distinct from 
each other and from b^,. . . ,b^. Thus we have constructed t = \B\ — \D\ vertex disjoint 
paths Pi,. . . ,Pt where Pi starts at b~ in B, goes through C, then it uses 1 or 2 vertices of A, 
and it ends at 6^ in B. Consider a new oriented graph H, obtained from G by contracting 
these paths. Note that the vertices of H correponding to the paths Pi are acceptable and 
the analogues of B and D in H now have equal size. So we can apply Claim 2.4 to find 
a Hamilton cycle in H. This corresponds to a Hamilton cycle in G, which contradicts our 
assumption. Therefore there are less than \B\ — \D\ vertices with property ^:C>i/ioo- 

The statement for property C : ^>i/ioo follows in a similar way, as we can again start 
by finding a matching of size \B\ — \D\ consisting of edges directed from C to A. The 
arguments for the other two properties are also similar. For instance, if we have \B\ — \D\ 
vertices vi, . . . ,Vt in A having property A:Byi/iQQ we can find a matching b^vi, . . . ,b'j~Vt of 
edges directed from B to A such that the b^ are cyclic. We can extend this to — vertex 
disjoint paths Pi, . . . ,Pt, where Pi starts with the edge b^Vi, it then goes either directly to 
a cyclic vertex in B or it uses one more vertex from A before it ends at a cyclic vertex in B. 
Now we construct a new oriented graph H similarly as before and find a Hamilton cycle in 
H and then in G. □ 

We now say that a vertex is good if it is acceptable, and also has one of the properties 

^:-B<i/iooC<i/ioO) -S:^^'^'''^°°-B<i/ioo^<i/ioO) C-.A^^^^^^B^^^^^^ or D: 

(The last option means that every acceptable vertex in D is automatically good.) Note that 
a cyclic vertex is not necessarily good. 

Claim 2.6. By reassigning at most 0{-s/c)n vertices we can arrange that every vertex is 
good. 

Proof. While \B\ > \D\ we reassign vertices that are bad (i.e. not good) as follows. Suppose 
there is a bad vertex x e B U C with |7V+(x) n A\ > n/400. If we also have \N^{x) n 
C\ > n/400 or \N~{x) Ci B\ > n/400, then by reassigning re to L> it will become a good 
vertex. (Recall that for a vertex in D, being acceptable and good is the same.) If not, 
then \N~{x) n C\,\N~{x) Ci B\ < n/400, and so \N~{x) n ^| > n/400. In this case we can 
make x good by reassigning it to A. Exactly the same argument works if there is a bad 
vertex x e BUC with |A^"'"(a;) Ci B\ > n/400. If x was in B we have decreased \B\ — \D\ by 
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1 or 2, whereas by Claim 2.5 we will make at most 0{-\/c)n such reassignments of vertices 
X G C. 

Similarly, suppose there is a bad vertex x & A\JB with \N~{x) f\B\> n/400 or \N~{x) D 
C\ > n/400. If wc also have \N+{x) D A\ > n/400 or \N+{x) H B\ > n/400, then by 
reassigning x to I? it becomes good. If not, then \N~^(x) Ci A\, \N~^(x) D B\ < n/400, and so 
\N^{x)nC\,\N^{x)r]D\ > n/400. In this case we can reassign x to C. Again, if x was in B 
we have decreased \B\ — \D\ by 1 or 2, whereas by Claim 2.5 we will make at most 0{^/c)n 
such reassignments of vertices x G A. 

The above cases cover all possibilities of a vertex being bad. If during this process \B\ 
and \D\ become equal we can apply Claim 2.4 to find a Hamilton cycle. Alternatively, it 
may happen that \B\ — \D\ goes from +1 to —1 if a vertex x is moved from B to D. In this 
case wc claim that we can put x into A or C to achieve the following property: every vertex 
except X is acceptable, there is at least one acceptable edge going into x and at least one 
acceptable edge coming out of x. 

To see this, suppose first that one of N~{x) n A or N~{x) n D is non-empty. If one of 
iV+(x) n A or iV+(x) n i? is also non-empty we can put x into A. Otherwise A''"'"(x) n C and 
iV+ (x) n D are both non-empty. Now we can put x into C unless N~ (x) fl B and N~ (x) fl C 
are both empty, which cannot happen, as then x would not have been a bad vertex of B. 
Now suppose that N^{x) n A and N~{x) f] D are both empty, so N~{x) f) B and N~{x) D C 
are both non-empty. We can put x into C unless A^"'"(x) n C and A^+(x) n D are both empty. 
But this implies that x satisfies the property B : A^^B^y^C^^i. This cannot happen, as 
then X would have been reassigned to D as a cyclic vertex in Claim 2.3. Therefore we can 
apply the 'moreover' part of Claim 2.4 to find a Hamilton cycle. But since we are assuming 
there is no Hamilton cycle the process must terminate with an assignment of vertices where 
all vertices are good. □ 

Let M be a maximum matching consisting of edges in E{B, A)l}E{B)UE{C, A)DE{C, B). 
Say that M n E{B,A) matches Ba C B with Ab C A, that M n E{B) is a matching on 
Bb C B, that M n E{C, A) matches Ca C C with Ac C A and that M n E{C, B) matches 
CbCC with Be C B. Note that e(M) = \Ab\ + \Ac\ + \Bb\/2 + \Bc\. 

We must have e(M) < \B\ — \D\. Otherwise, by a similar argument to that in Claim 2.5 
we could extend t := \B\ — \D\ edges v^Vi of M to \B\ — \D\ vertex disjoint paths Pi, . . . ,Pt, 
where Pi includes v^Vi, starts and ends at cyclic vertices b^, bf in B, and uses two more 
vertices from B than from D. Then we would find a Hamilton cycle as in Claim 2.5, which 
would be a contradiction. 

Claim 2.7. e(M) = and \B\ - \D\ = 1. 

Proof. We will first prove that e(M) = 0. Assume to the contrary that e(M) > 1. Let 

A' := A\{AbUAc), B' := B\{Ba^Bb^Bc) and C := C\{Ca^Cb)- By the maximality 
of M there are no edges from B' U C to A' or from B' U C to B'. Since all vertices are good 
this implies 

e{C,A)<eiCAUCB,A) + e{C,ABUAc) < U Cs| + |Ab U Ac|)(l/100 + 0(V^)n/4 
(7) < 2e(M)(n/4)/99 < e{M)\A\/A9. 

Similarly e{B,A) < e(M)|^|/49. So by considering the average indegree of the vertices in 
A we can find a vertex a e A such that 

d-ia) < \A\/2 + \D\ + 2e(M)/49. 
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Also, we can obtain e(C, B) < e(M)|C|/49 in a similar way as ([7|). So by averaging, we can 
find a vertex c £ C with 

d+{c) < \C\/2 + \D\ + 2e(M)/49. 
By the maximality of M, every edge in B is incident with a vertex of Bj^UBb^Bq- Together 
with the fact that every vertex is good, this implies that 

e{B) < \Ba UBbU Bc\ ■ 2(1/100 + 0(V^))n/4 < 2e{M)\B\/i9. 
So we can find h £ B with 

d{h) < \A\ + \C\ + \D\ + 4e(M)/49. 
Now we use the minimum semi-degree condition and the above inequalities to get 

(8) < d'{a) + d+{c) + d{b) < ^{\A\ + \C\ + 2\D\) + 8e(M)/49 

Substituting n = \A\ + \B\ + \C\ + \D\ and simplifying gives 

(9) \B\ - \D\ < (2/3)(2 + 8e(M)/49) = 4/3 + 16e(M)/147. 

On the other hand, we previously observed that \B\ — \D\ > e(M) + 1. Combining this gives 
131e(M)/147 < 1/3, which is only possible if e{M) = 0. Now ([9]) immediately implies the 
claim. □ 

Note that e(M) = implies that e{B U C,A) = 0. So by averaging, there is a vertex 
a € A with d~{a) < {\A\ - l)/2 + \D\. Since e(M) = also implies that e{C,B) = and 
e(i?) = there is vertex c £ C with d+(c) < (|C| - l)/2 + \D\ and a vertex b £ B with 
d{b) < \A\ + \C\ + \D\. So as in ([8]), using \D\ = \B\ - 1 we obtain that 

3t7 — 4 3 3 

< d-{a) + (i+(c) + d{b) < -{\A\ + Id + 2|D|) - 1 = -(n - 1) - 1, 

which is impossible. This contradiction to our initial assumption on G completes the proof. 

□ 

As remarked in the introduction, the proof can be modified to find a cycle of any given 
length £ through any given vertex v of G, where £ > say. We give an outline of 

the necessary modifications here, the details can be found in [llj. In Case 1, we find the 
cycle as follows: consider a random subset Q of £ — 1 vertices of G — v obtained by choosing 
{£ — 1)/|-R| vertices from each cluster Vi. With high probability, the oriented subgraph Gq 
induced by Q{J{v} also has i? as a reduced oriented graph (with slightly worse parameters). 
Thus Lemma [To] implies that Gq contains a Hamilton cycle. So suppose we are in Case 2. 
The only significant difference in the argument is in the application of the Blow-up Lemma 
in the proof of Claim 2.4. After contracting paths so that all vertices are now acceptable and 
1^1 = \B\ = \C\ = \D\, we then find a single path Pi which contains all the vertices which 
correspond to contracted paths and also contains v. We then apply the Blow-up Lemma to 
find a path P2 which (i) contains all but n — £ of the remaining vertices, (ii) whose initial 
vertex lies in the outneighbourhood of the final vertex of Pi and (iii) whose final vertex lies 
in the inneighbourhood of the initial vertex of Pi. Then Pi U P2 corresponds to a cycle of 
length £ through v in G. 
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